Fitting Com-Poisson Mixtures to Bimodal Count Data
نویسندگان
چکیده
Bi-modal truncated count distributions are frequently observed in aggregate surveys and ratings when respondents are mixed in their opinion. They also arise in censored count data, where the highest category might create an additional mode. The Poisson distribution is the most common distribution for fitting count data and can be modified to achieve mixtures of truncated Poisson distributions. However, it is suitable only for modeling equi-dispersed distributions and is limited in its ability to capture bi-modality. The Conway-Maxwell-Poisson (CMP) distribution is a two-parameter generalization of the Poisson distribution that allows for overand under-dispersion. While the CMP is much more flexible, it still cannot capture bi-modality. In this work, we propose a mixture of CMPs for capturing a wide range of truncated count data, which can exhibit unimodal behavior (with equi-, underor over-dispersion) as well as bimodal behaviour. We present methods for estimating the parameters of a mixture of two CMP distributions using an EM approach. Our approach introduces a special two-step optimization within the M-step to estimate multiple parameters. The methods are illustrated using simulated and real data.
منابع مشابه
A Flexible Regression Model for Count Data
Poisson regression is a popular tool for modeling count data and is applied in a vast array of applications from the social to the physical sciences and beyond. Real data, however, are often overor under-dispersed and, thus, not conducive to Poisson regression. We propose a regression model based on the Conway–Maxwell-Poisson (COM-Poisson) distribution to address this problem. The COM-Poisson r...
متن کاملCharacterizing the performance of the Conway-Maxwell Poisson generalized linear model.
Count data are pervasive in many areas of risk analysis; deaths, adverse health outcomes, infrastructure system failures, and traffic accidents are all recorded as count events, for example. Risk analysts often wish to estimate the probability distribution for the number of discrete events as part of doing a risk assessment. Traditional count data regression models of the type often used in ris...
متن کاملFitting of Count Time Series Models on the Number of Patients Referred to Addiction Treatment Centers in Semnan County
Abstract. Count data over time are observed in many application areas. Many researchers use time series patterns to analyze this data. In this paper, the poisson count time series linear models and negative binomials on this type of data with the explanatory variables are studied. The Likelihood analysis and the evaluation of count time series model based on generalized linear models are pres...
متن کاملExtended Conway-Maxwell-Poisson distribution and its properties and applications
A new four parameter extended Conway-Maxwell-Poisson (ECOMP) distribution which unifies the recently proposed COM-Poisson type negative binomial (COM-NB) distribution [Chakraborty, S. and Ong, S. H. (2014): A COM-type Generalization of the Negative Binomial Distribution, Accepted in Communications in Statistics-Theory and Methods] and the generalized COM-Poisson (GCOMP) distribution [Imoto, T. ...
متن کاملCharacterizing the Performance of the Bayesian Conway-maxwell Poisson Generalized Linear Model
This paper documents the performance of a Bayesian Conway-Maxwell-Poisson (COM-Poisson) generalized linear model (GLM). This distribution was originally developed as an extension of the Poisson distribution in 1962 and has a unique characteristic, in that it can handle both under-dispersed and over-dispersed count data. Previous work by the authors lead to the development of a dual-link GLM bas...
متن کامل